Robust glottal closure detection using the wavelet transform
نویسندگان
چکیده
In this work, a time-scale framework for analysis of glottal closure instants is proposed. As glottal closure can be soft or sharp, depending on the type of vocal activity, the analysis method should be able to deal with both wide-band and low-pass signals. Thus, a multi-scale analysis seems well-suited. The analysis is based on a dyadic wavelet filterbank. Then, the amplitude maxima of the wavelet transform are computed, at each scale. These maxima are organized into lines of maximal amplitude (LOMA) using a dynamic programming algorithm. These lines are forming “trees” in the time-scale domain. Glottal closure instants are then interpreted as the top of the strongest branch, or trunk, of these trees. Interesting features of the LOMA are their amplitudes. The LOMA are strong and well organized for voiced speech, and rather weak and widespread for unvoiced speech. The accumulated amplitude along the LOMA gives a very good measure of the degree of voicing.
منابع مشابه
Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima
This paper deals with singularities characterisation and detection in Electroglottogram (EGG) signal using wavelet transform modulus maxima. These singularities correspond to glottal opening and closure instants (GOIs and GCIs), Wavelets with one and two vanishing moments are applied to EGG signal. We show that wavelet with one vanishing moment is sufficient to detect singularities of EGG signa...
متن کاملAn automatic pitch-marking method using wavelet transform
This paper describes a new automatic pitch-marking method using wavelet transform. This method detects discontinuity in the speech waveform which occurs at the glottal closure instant (GCI). A time domain prosodic modification technique requires an appropriate determination of the synthesis pitch-marks. We evaluated the performance of the newly developed pitchmarking method by using our interna...
متن کاملThe Improvement of the Wavelet Analysis Techniques by Using B-spline Functions Family
The goal of this paper is to justify the use of fractional B-spline bases in several Wavelet Analysis Techniques. We are interested in better controlling the differentiation behavior of the wavelet. For that purpose, we propose an edge detector algorithm by using the wavelet transform local maxima, based on B-spline functions. Another task is to search the connection available between energetic...
متن کاملPhase-Based Methods for Voice Source Analysis
Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximu...
متن کاملGlottal flow derivative modeling with the wavelet smoothed excitation
This paper discusses a method for estimating glottal flow derivative model parameters using the wavelet-smoothed excitation. The excitation is first estimated using the Weighted Recursive Least Squares with Variable Forgetting Factor algorithm. The raw excitation is then smoothed by applying a Discrete Wavelet Transform (DWT) using Biorthogonal Quadrature filters, and a thresholding operation d...
متن کامل